Closed-Loop Dynamic Control of a Soft Manipulator Using Deep Reinforcement Learning

نویسندگان

چکیده

The focus of the research community in soft robotic field has been on developing innovative materials, but design control strategies applicable to these platforms is still an open challenge. This due their highly nonlinear dynamics which difficult model and degree stochasticity they often incorporate. Data-driven controllers based neural networks have recently explored as a viable solution be employed for manipulators. letter presents network-based closed-loop controller, trained by deep reinforcement learning algorithm called Trust Region Policy Optimization (TRPO). training takes place simulation, using approximation robot forward dynamic obtained with Long-short Term Memory (LSTM) network. controller allows following different paths executed velocities workspace robot. results demonstrate that effective normal working conditions payload attached end-effector manipulator.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Closed-loop manipulator control using quaternion feedback

Some operational details of the zero reference positionmethod are presented in the context of deriving kinematical equations fora robot with a nonspherical continuous roll wrist.

متن کامل

Closed-Loop Object Recognition Using Reinforcement Learning

Current computer vision systems whose basic methodology is open-loop or filter type typically use image segmentation followed by object recognition algorithms. These systems are not robust for most real-world applications. In contrast, the system presented here achieves robust performance by using reinforcement learning to induce a mapping from input images to corresponding segmentation paramet...

متن کامل

Reinforcement Learning for Mixed Open-loop and Closed-loop Control

Closed-loop control relies on sensory feedback that is usually assumed to be free . But if sensing incurs a cost, it may be costeffective to take sequences of actions in open-loop mode. We describe a reinforcement learning algorithm that learns to combine open-loop and closed-loop control when sensing incurs a cost. Although we assume reliable sensors, use of open-loop control means that action...

متن کامل

Dynamic and Closed-Loop Control

3 Classical closed-loop control 8 3.1 PID feedback . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9 3.2 Transfer functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10 3.3 Closed-loop stability . . . . . . . . . . . . . . . . . . . . . . . . . . . 11 3.4 Gain and phase margins and robustness . . . . . . . . . . . . . . . . 12 3.5 Sensitivity function and fundamental...

متن کامل

Reinforcement Learning for Multi - Linked Manipulator Control

We present an automatic trajectory planning and obstacle avoidance method for a multi-linked manipulator which uses position and velocity sensor information directly to produce the appropriate real-valued torques for each joint. The inputs are fed into a Cerebellar Model Arithmetic Computer (CMAC) [1] and in each state, the expected reward and torques for each joint are learnt through self-expe...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE robotics and automation letters

سال: 2022

ISSN: ['2377-3766']

DOI: https://doi.org/10.1109/lra.2022.3146903